Back

Molecular Genetics and Genomics

Springer Science and Business Media LLC

Preprints posted in the last 90 days, ranked by how well they match Molecular Genetics and Genomics's content profile, based on 11 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
MagNet: Computational Methods for Constructing High-Confidence Protein-Protein Interaction Networks in Magnaporthe oryzae

Kim, H.; Cheong, K.; Jeon, J.; Choi, G.; Koh, J.; Song, H.; Hue, Y.; Nam, Y.; Choi, B.; Lim, Y.-J.; Choi, J.; Kim, K.-T.; Lee, Y.-H.

2026-05-14 genomics 10.64898/2026.05.11.724438 medRxiv
Top 0.1%
2.1%
Show abstract

Magnaporthe oryzae, the rice blast fungus, plays a role as a model organism for molecular plant-microbe interaction research. Studies on the pathogenic mechanism of this fungus revealed many genes involved in signaling pathways. As multi-omics data are being available, genomic-level researches have been conducted to uncover the underlying biological processes during the pathogenesis of M. oryzae. Identifying the genome-wide protein-protein interaction (PPI) network is one of the omics-level approaches, which helps to understand signaling and regulatory pathways. However, existing biological network resources of M. oryzae are not sufficient to decipher pathogenesis mechanisms due to the abundance of false positives/negatives. In this study, a reliable PPI network database of M. oryzae, MagNet, was constructed with three methods, including homology-based Interolog search, co-expression network construction, and domain-domain interaction (DDI)-based prediction. With three approaches altogether, the pan-network with 5,600,976 interactions was generated, including 217,531 highly confident interactions supported by all three methods. Experimental data on M. oryzae PPIs supported that our PPI network can predict PPIs with higher accuracy compared to the previously constructed databases. MagNet would provide integrated biological network data, which can help to understand the molecular mechanisms of the rice blast fungus. The PPI data can be accessed via https:/magnet.scnu.ac.kr.

2
Recurrent LINE 1 exonization drives transcriptome remodelling in NSCLC

Parida, A. S.; Kumar, A.; Tiwari, B.

2026-04-24 genomics 10.64898/2026.04.22.720055 medRxiv
Top 0.1%
1.9%
Show abstract

The only autonomously active transposable elements in the human genome are Long interspersed nuclear element-1 (LINE-1) elements. These elements are known to play an important role in changing the transcriptome. LINE-1 sequences affect gene regulation during post-transcription processing, along with their established role in retrotransposition. Exonization is one mechanism where the LINE-1 integrated genome undergoes alternative splicing to produce new isoforms of transcripts. Our work mainly highlights the effect of LINE-1 associated exonization, focusing on the formation of isoforms of transcripts. Using Non-small cell lung cancer (NSCLC) as a model, we conducted a detailed transcriptome study that combines splice junction profiling with gene expression data. Our results show that LINE-1 sequences are often included as exons in host transcripts, leading to the formation of new exons and their various isoforms. The events are validated by solid splice junction evidence that proves the reliability and reproducibility. In particular, it was observed that repetitive analyses revealed certain LINE-1 exonization events that were consistent. The finding indicates that LINE-1 act as recurrent sources of splice ready sequences. Though exonizations do not necessarily affect the total expression levels of genes, our study reveals that they certainly contribute to transcript diversity. The diversity of isoforms generated potentially contributes to the effects of gene function. This study is limited to NSCLC, but it is likely that the exonizations events play a crucial role in the altering RNA diversity in cancers. Therefore the study elucidates new insights into how transposable elements modify gene structure and function during cancer development.

3
Clarified an rDNA Gene Unit Pattern with (CTTT)n and (CT)n Microsatellites Aggregation Ahead of and Behind the Gene in Human Genome

Shen, J.; Tang, S.; Xia, Y.; Qin, J.; Xu, H.; Tan, Z.

2026-03-24 genetics 10.64898/2026.03.22.713381 medRxiv
Top 0.1%
1.7%
Show abstract

BackgroundConventional models of human ribosomal DNA (rDNA) array organization have historically depended on transcription-centric boundaries, partitioning the unit into a [~]13 kb rDNA transcription region and a monolithic [~]31 kb intergenic spacer (IGS). While our previous identification of Duplication Segment Units (DSUs) mapped these arrays based on an intuitive analysis of the microsatellite density landscape of the complete reference human genome, our present deep mining of this landscape has revealed a more accurate rDNA Gene Unit Pattern. Methods & ResultsIn this study, we conducted a deep mining analysis of our previously established microsatellite density landscape of the T2T-CHM13 assembly, focusing specifically on nucleolar organizing regions (NORs). We suggest a more accurate rDNA Gene Unit Pattern containing a (CTTT)n microsatellite aggregation ahead of the rDNA gene and a (CT)n microsatellite aggregation behind the gene, rather than a pattern featuring an IGS region inserted between two rDNA genes. ConclusionsA correct rDNA gene pattern of the human genome probably includes a (CTTT)n microsatellite aggregation ahead of the gene and a (CT)n microsatellite aggregation behind it, which possibly constitute cis- and trans-regulating regions; the (CTTT)n and (CT)n microsatellite aggregations may provide two different local stable DNA structures for regulatory protein binding.

4
Gene model for the ortholog of Lst8 in Drosophila yakuba

Lawson, M. E.; Sanow, K. A.; Chetana, K.; Taylor, E.; Morgan, A.; Flannery, D.; Elsie, C.; Rele, C. P.; Reed, L. K.; O'Rourke, K. S.

2026-05-14 genomics 10.64898/2026.05.12.723325 medRxiv
Top 0.2%
1.3%
Show abstract

Gene model for the ortholog of Lst8 (Lst8) in the May 2011 (WUGSC dyak_caf1/DyakCAF1) Genome Assembly (GenBank Accession: GCA_000005975.1) of Drosophila yakuba. This ortholog was characterized as part of a developing dataset to study the evolution of the Insulin/insulin-like growth factor signaling pathway (IIS) across the genus Drosophila using the Genomics Education Partnership gene annotation protocol for Course-based Undergraduate Research Experiences.

5
A bioluminescence resonance energy transfer (BRET) assay to detect telomere length in S. cerevisiae

Richter, F.; Ropiak, H. M.; Urban, J.; Franke, J.

2026-03-13 genomics 10.64898/2026.03.11.711003 medRxiv
Top 0.2%
1.3%
Show abstract

A method to measure telomere length in S. cerevisiae was developed based on bioluminescence resonance energy transfer (BRET). The system uses energy transfer between a luciferase-Rif2 fusion protein and fluorescently tagged Rap1. The study demonstrates that the BRET ratio correlates with the Rap1/Rif2 complex at the telomeres and thus the availability of telomeric Rap1 binding sites. This enables the measurement of telomere length in living cells. The system was able to reproduce reported deviations in telomere length in mutants lacking telomere length regulators, cells treated with telomere length modifying compounds and strains expressing inducible telomerase. The BRET ratio linearly correlated with the average number of telomeric nucleotides derived from long-read sequencing data using a novel algorithm for telomere length calculation. GRAPHICAL ABSTRACT O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=200 SRC="FIGDIR/small/711003v1_ufig1.gif" ALT="Figure 1"> View larger version (39K): org.highwire.dtl.DTLVardef@1850c4dorg.highwire.dtl.DTLVardef@1ead295org.highwire.dtl.DTLVardef@1a76358org.highwire.dtl.DTLVardef@6b3183_HPS_FORMAT_FIGEXP M_FIG C_FIG

6
Integrative Identification and Characterization of PCOS-Associated lncRNAs From the Interface of Genetic Association, Transcriptomics, and Gene Structure Evolution

He, Z.; Li, Y.; Shkurat, T. P.; Butenko, E. V.; Derevyanchuk, E. G.; Lomteva, S. V.; Chen, L.; Lipovich, L.

2026-04-02 genomics 10.64898/2026.03.31.715548 medRxiv
Top 0.3%
0.9%
Show abstract

BackgroundPolycystic ovary syndrome (PCOS) is a prevalent endocrine disorder and a leading cause of female infertility, with complex genetic, metabolic, and hormonal etiologies. Long non-coding RNAs (lncRNAs) have emerged as important regulators of diverse biological processes, yet their roles in PCOS remain underexplored. Here, we identified and characterized PCOS differentially expressed gene-associated lncRNAs (PDEGAL) with an integrative approach combining expression data, genetic association, and evolutionary analysis. MethodsThirty-three PCOS-associated protein-coding genes were obtained from our prior study, and all their nearby and overlapping lncRNAs were annotated. These candidates were analyzed using UCSC Genome Browser-mapped annotations and datasets, including NCBI RefSeq, GENCODE, GTEx, GWAS SNPs, and conservation, as well as the FANTOM5 cap analysis of gene expression (CAGE) promoter data, to assess their expression, regulatory potential, genetic variant overlaps, and evolutionary conservation. ResultsTwenty-three PDEGALs (18 antisense to, and 5 sharing bidirectional promoters with, known PCOS-associated protein-coding genes) were identified. 17 PDEGALs contained GWAS SNPs with statistically significant disease associations, 9 of which were associated with PCOS-related traits. 5 PDEGALs demonstrated expression in the KGN granulosa cell model of PCOS. Key gene structure element (KGSE) analysis revealed that most PDEGALs are primate-specific. Integrating four criteria--GTEx expression, GWAS SNPs, FANTOM promoterome, and KGSE conservation--highlighted HELLPAR as the only lncRNA fulfilling all four, while five others--PGR-AS1, MTOR-AS1, ENSG00000265179, ENSG00000256218, and LOC105377276--fulfilled three of the four criteria. ConclusionsWe have systematically identified candidate PCOS regulatory lncRNAs with convergent genetic, expression, and evolutionary evidence. These results provide a framework for functional validation and highlight lncRNAs as potential biomarkers and therapeutic targets in PCOS that function by regulating their nearby and overlapping protein-coding genes.

7
Exploratory 16S rRNA Metagenomic Analysis of Soil Microbial Communities in Agroecosystems of North-Central Argentina

Palma, L.; Guzman, A. L.; Marozzi, A.; Del Valle, E. E.; Castoldi, L.

2026-04-01 genetics 10.64898/2026.03.31.715494 medRxiv
Top 0.3%
0.8%
Show abstract

Agriculture has modified the soil structure due to the influence of external factors and processes that affect microbial biodiversity. Metagenomics is a fundamental tool for the study of soil microbial diversity because it provides information about the ecosystem diversity, including both the microorganisms that cannot be isolated in culture media and those that are no longer viable in the analyzed sample. In this work, six soil samples obtained from agroecosystems of central and northern Argentina were subjected to a preliminary 16S metagenomic analysis. Copiotrophic bacteria (Proteobacteria and Actinobacteria) were dominant and one of the samples had a dominance of an oligotrophic Phylum (Acidobacteria). Our findings support previous evidence from traditionally managed agroecosystems and provide new insights into the diversity of soil microbiomes in Argentine regions outside the Pampas. Finally, we analyzed the most common genera with relevant species to agronomy, both beneficial and pathogenic, and their abundance and diversity in the sequenced samples.

8
Position of de novo purine biosynthesis gene disruptions shapes purine-starvation phenotypes in Saccharomyces cerevisiae

Ozolina, Z.; Kokina, A.; Zile, A.; Pleiko, K.; Auzins, E. T.; Kristjuhan, A.; Liepins, J.

2026-03-05 microbiology 10.64898/2026.03.04.709599 medRxiv
Top 0.3%
0.8%
Show abstract

Purine moieties are essential for many functions within the eukaryotic cell, including energy, signaling and nucleic acid synthesis. While purine starvation is known to induce stress resistance in eukaryotic model organism budding yeast Saccharomyces cerevisiae, it remains unclear whether the physiological response is related to disruption of synthesis pathway in particular position or it is uniform across all genetic deficiencies within the de novo adenine biosynthesis pathway. It is also not known how purine starved cells perceive purine shortage - weather they share the same signaling elements with nitrogen starvation or not. MethodsWe characterised physiology of strains with deletions in adenine biosynthesis pathway when cultivated in full or purine deficient and compared to cell physiological parameters when cultivated in nitrogen deficient media. We tested stress tolerance, carbon flux, cell cycle arrest and did transcription profiling (RNA-seq). ResultsOur findings demonstrate that purine starvation-induced stress resistance is significantly modulated by the specific step at which the pathway is interrupted. Transcriptional analysis revealed that purine starvation in many aspects phenocopies nitrogen starvation, particularly - in both starvations strong downregulation of ribosome related genes occurs. In the same time several metabolic features which differ from N- and ade- starvations: pentose phosphate pathway is specifically upregulated within ade4{Delta}-ade2{Delta} and downregulated in N-cells. Notably, the expression of stress-responsive genes such as HSP12, HSP26, and GRE1 varied between mutants, suggesting that the accumulation of pathway intermediates (e.g., AIR in ade2{Delta}) or the absence of downstream precursors (AICAR) alters the perception of starvation especially in the case of ade16{Delta}ade17{Delta} strain. ConclusionsMetabolic and stress-tolerance phenotypes of purine auxotrophs are not merely a result of purine depletion but seems that the response is signalled via the same pathways, like TOR1. The results suggest that strains having mutations within various positions of the purine pathway "perceive" purine limitation a bit differently - especially when we compare the end of the pathway with the other mutants. Different phenotypic outcomes of the occasional purine depletion might give preferences for organisms which have mutations in the beginning rather at the end of the pathway. Besides, our findings might have implications in the design of synthetic pathways and the use of auxotrophic markers in yeast research.

9
In silico restriction site analysis of whole genome sequences shows patterns caused by selection and sequence duplications

Vedder, L.; Schoof, H.

2026-05-16 genomics 10.64898/2026.05.15.725336 medRxiv
Top 0.4%
0.8%
Show abstract

Biological sequences are known to be not random. Thus, the comparison of in silico restriction fragment distributions of random and biological sequences may be an indicator of this non-randomness. Our analyses show that for most of the tested combinations of restriction enzyme and genome sequence the fragments per Megabase of the biological sequence deviate at least more then 10% from the corresponding random sequence. This deviation goes into both directions, i.e. clearly increased values are as common as clearly decreased values. Although there is no species- or restriction-enzyme-specific effect, a clear impact of the GC content both of the restriction site and of the genome sequence can be seen. In contrast to the random sequences, the genome sequences show distinct peaks in their fragment length distributions, hinting to repetitive elements such as transposons.

10
The multidimensional structure of wellbeing: genetic evidence from a multivariate twin study including the Mental Health Continuum

Azcona Granada, N.; Geijsen, A.; de Vries, L. P.; Pelt, D.; Bartels, M.

2026-03-30 genetics 10.64898/2026.03.27.714768 medRxiv
Top 0.4%
0.7%
Show abstract

Wellbeing is commonly defined as the combination of feeling good and functioning well and typically conceptualized as two related but distinct components. Hedonic wellbeing emphasizes pleasure, happiness, and life satisfaction, while eudaimonic wellbeing focuses on meaning, personal growth, flourishing, and the realization of ones potential. The Mental Health Continuum-Short Form was developed as a comprehensive measure of wellbeing and includes three subscales assessing emotional, social, and psychological wellbeing. Although the Mental Health Continuum total score is often interpreted as an indicator of overall wellbeing, the underlying genetic structure of its three subscales and its genetic overlap with other commonly used wellbeing measures remains unclear. Using data from 5,212 individuals from the Netherlands Twin Register (72% female, mean age 36.4), we fitted multivariate twin models to examine the genetic architecture of the Mental Health Continuum and its associations with other wellbeing measures (quality of life, life satisfaction, subjective happiness, and flourishing). Results indicate that, at the genetic level, the Mental Health Continuum is best explained by its three distinct subscales rather than by a latent factor. When considering the Mental Health Continuum together with the other wellbeing measures, we found moderate to high genetic correlations (r = 0.52 - 0.83), indicating substantial overlap in the genetics underlying the wellbeing constructs. However, we did not find evidence for a single common genetic factor underlying all constructs. These findings highlight the multidimensional structure of wellbeing, but the moderate to high genetic correlations across measures suggest that it is important to align the level of measurement (phenotypic vs genetic) with the research question.

11
Environmental Regulation and Gene-by-Environment Interaction Influence RAP1 Activity and its Impact on Gene Expression

Kalra, S.; Sanchez, G.; Stubin, A.; Le, A.; Bakshian, A.; Ortiz Diaz, B.; Mark, B. M.; Pena, C.; Parker, E.; Johnston, E.; Hsu, E.; Brangham, G.; Bala-Mehta, I.; Perez, L.; Milrod, M.; Stanten, M.; Nakamura, M.; Hwang, P.; Ptaszynska, S.; Cander, S.; Park, S.; Tan, T. L.; Zhou, Y.; Coolon, J.

2026-05-09 genomics 10.64898/2026.05.06.723246 medRxiv
Top 0.4%
0.7%
Show abstract

Gene-by-environment (GxE) interactions play a major role in shaping both phenotypic and molecular variation, with important implications for human health and disease. In this study, we used the Doxycycline (Dox) regulated, tetracycline-responsive (Tet-Off) promoter system to sequentially reduce or titrate gene expression levels of the essential yeast transcription factor Repressor Activator Protein 1 (RAP1) similar to a hypomorph allele series, across three distinct environments: Yeast Peptone Dextrose (YPD) media, YPD media with Heat Shock (HS), and Yeast Peptone Acetate (YPAC) media. We then performed RNA sequencing (RNA Seq) to assess global transcriptional responses to RAP1 reduction in these different growth environments. Our analysis first focused on the independent effects of varying RAP1 expression levels within and across environments. We then explored GxE interactions, revealing a subset of genes with significant consequences of reduced levels of RAP1 and environment-specific expression patterns. Notably, many genes exhibited opposite effects of RAP1 titration on gene expression when yeast were grown in YPAC media compared to YPD media and/or HS, suggesting environment-dependent regulatory architecture. This design reveals how cells integrate internal transcriptional and regulatory changes with external environmental cues, providing a deeper view of GxE architecture. Using Weighted Gene Co-expression Network Analysis (WGCNA), we identified co-regulated gene modules, and by combining this with transcription factor motif enrichment tests, our study identified candidate regulators driving their dynamics. Our findings demonstrate that gene regulatory networks can vary dramatically depending on the environmental context an organism experiences, which can then influence the specific phenotypes produced by a particular genetic perturbation. This illustrates the complexity of genotype-environment interactions and the importance of studying gene function in multiple environments to gain a truly comprehensive understanding of a genes sometimes numerous and diverse functions.

12
Deep analysis of FANTOM CAGE data reveals hierarchical patterns of TSS co-deployment hubs and their disruption in cancers

Meduri, R.; Satish, A. L.; Singh, U.

2026-05-18 genomics 10.64898/2026.05.15.725323 medRxiv
Top 0.4%
0.7%
Show abstract

Selective deployment of multiple transcription start sites is a major regulatory feature of human transcriptomes. FANTOM CAGE data exhibit a near-universal TSS deployment parsimony which is disrupted in cancers. We have recently shown that TSS deployment is sensitive to gene function, futile upstream transcription, and cellular biosynthetic states. Patterns in FANTOM CAGE data can reveal mechanisms underlying TSS co-deployments. We propose and test the possibility that some TSSs act like epromoters and act as co-varying hubs of transcriptional activities for multiple other promoters. Using deep analysis of CAGE data implemented through neural networks we show that non-cancers implement transcription co-deployments through cores of epromoter-like TSSs which are generally proximal to their start codons. These TSSs show enhancer-like TFBSs profiles. A comparison with cancer CAGE data shows that the concentrated epromoter core is disrupted in cancers with multiple distal TSSs replacing the proximal TSS cores. We provide evidence that the core TSSs are rich in YY1 and CTCF binding sites and associated with genes coding for transcription factors. Our findings show that covariance of TSS deployment is sensitive to transcriptional resource cost and a parsimonic design of TSS co-deployments depends on proximal TSSs in non-cancers, a mechanism grossly disrupted in cancers. HighlightsO_LIHeterogeneous FANTOM CAGE data contains universal patterns of TSSs co-deployments. C_LIO_LITSS co-deployments exhibit a parsimonious "core-covariant" scheme which is disrupted in cancers. C_LIO_LICore TSSs are enriched in transcription factor binding sites and gene functions which justify biological features of the samples. C_LIO_LIThe DL pipeline we present identifies the core-covariant TSS sets in an unbiased manner. C_LI

13
Altered salivary miRNA profiles and microbiome composition correlate with psychosocial stress

Garbarino, S.; Magnavita, N.; Pardini, B.; Tarallo, S.; Cipriani, F.; Camandona, A.; Ferrero, G.; Scoditti, E.; Naccarati, A. G.

2026-03-23 molecular biology 10.64898/2026.03.20.713173 medRxiv
Top 0.4%
0.7%
Show abstract

Psychosocial stress is a significant risk factor for mental and physical illness, and emerging evidence suggests that altered oral microRNAs (miRNAs) and microbiome may act as biomarkers or mediators of stress responses. This study investigated stress-associated molecular changes in saliva from 113 male police officers. Based on repeated administrations of the Karasek Demand/Control and Effort/Reward Imbalance questionnaires, subjects were stratified by perceived stress response (SR) to homogeneous occupational stressors into low, intermediate, or high responders. Salivary miRNA profiles were analyzed using small RNA sequencing, and microbiome composition was assessed through shotgun metagenomics. Eighteen miRNAs were significantly differentially expressed between high- and low-SR groups, with four miRNAs with increasing (miR-10400-5p, miR-1290, miR-6074-5p, and miR-9902) and fourteen with decreasing (including miR-21-5p and mirR-142-3p) levels in the high SR group (adj.p<0.05). The identified salivary miRNAs showed a progressive alteration from low- to high-SR groups. Functional enrichment analysis indicated that dysregulated miRNA targets are involved in apoptosis, cellular stress responses, and metabolic regulation. Distinct salivary microbial communities were also observed across SR groups. Several taxa displayed progressive abundance shifts, with Prevotella baroniae and Schaalia odontolytica increasing and Actinomyces naeslundii and Capnocytophaga ochracea decreasing in the high SR group. Functional predictions revealed, in this group, a significant enrichment of inositol degradation pathways, paralleled by a reduction in bacteria involved in L-tryptophan and thiamine biosynthesis. These findings suggest that salivary miRNAs and microbiota profiles may serve as non-invasive biomarkers of psychosocial stress and provide insight into molecular mechanisms linking chronic stress to physiological and behavioral outcomes.

14
Chromosome-level genome assembly of macroalgae Gracilariopsis lemaneiformis

Hu, Y.; Huang, Y.; Yong, Y.; Shang, E.; Zhang, B.; Sui, Z.

2026-04-30 genomics 10.64898/2026.04.28.721235 medRxiv
Top 0.4%
0.7%
Show abstract

As an important cultivated red alga, Gracilariopsis lemaneiformis has great economic and ecological value. However, its existing genome assembly is highly fragmented and inadequately annotated. In this study, we constructed the first high-quality chromosome-level genome of Gp. lemaneiformis using PacBio long reads, Illumina short reads and Hi-C sequencing data. The assembled genome was approximately 86.66 Mb and the assembled sequences were anchored to 28 pseudo-chromosomes with lengths ranging from 1.70 to 7.81 Mb. 99.91% of the PacBio reads could be mapped to our assembly. In total, 8,664 genes were annotated, and the repeat elements identified in Gp. lemaneiformis constituted 65.04% of the whole genome, including 2.24% tandem repeat sequences and 62.81% interspersed repeats. We also established a high-evidence phylogenetic tree from 19 representative algae species, with the main aim to calculate their divergence times. This high-quality genome of Gp. lemaneiformis provides a crucial foundation for understanding genetic characteristics, investigating the genomic evolution, and facilitating molecular breeding.

15
Genomic indicators of gene function: A systematic assessment of the human genome

Cooper, H. B.; Rojas Lopez, K. E.; Schiavinato, D.; Black, M. A.; Gardner, P. P.

2026-04-09 genomics 10.64898/2026.04.08.717348 medRxiv
Top 0.4%
0.7%
Show abstract

Proteins and non-coding RNAs are functional products of the genome that are central for crucial cellular processes. With recent technological advances, researchers can sequence genomes in the thousands and probe numerous genomic activities of many species and conditions. Such studies have identified thousands of potential proteins, RNAs and associated activities. However there are conflicting interpretations of the results and therefore which regions of the genome are "functional". Here we investigate the relative strengths of associations between coding and non-coding gene functionality and genomic features, by comparing reliably annotated functional genes to non-genic regions of the genome. We find that the strongest and most consistent association between functional genes and genomic features are transcriptional activity and evolutionary conservation. We also evaluated sequence-based statistics, genomic repeats, epigenetic and population variation data. Other features strongly associated with function include histone marks, chromatin accessibility, genomic copy-number, and sequence alignment statistics such as coding potential and covariation. We also identify potential issues with SNP annotations in short non-coding RNAs, as some highly conserved ncRNAs have significantly higher than expected SNP densities. Our results demonstrate the importance of evolutionary conservation and transcription activity for indicating protein-coding and non-coding gene function. Both should be taken into consideration when differentiating between functional sequences and biological or experimental noise.

16
A historical specimen of False Lingzhi (Ganoderma lucidum) resolves a 245-year-old confusion within an important medicinal mushroom group

Varga, T.; Parker, S. R.; Agorini, A.; Dombrowski, A.; Hadfield, L.; Ainsworth, A. M.; Hawksworth, D. L.; Ghobad-Nejhad, M.; Papp, V.

2026-05-14 evolutionary biology 10.64898/2026.05.13.724775 medRxiv
Top 0.4%
0.7%
Show abstract

O_LIPlants and fungi are major sources of natural products beneficial to society, making the study of distinct species essential for discovering new drugs and bioactive compounds. The medicinal mushroom "Lingzhi" or "Reishi" (Ganoderma lingzhi) is widely used in traditional medicine and extensively studied for its bioactive triterpenoids, yet it is commonly identified as Ganoderma lucidum, the type species of the genus, which lacks a type specimen. C_LIO_LIWe sequenced a G. lucidum specimen preserved in the Kew fungarium, which agreed with the original description and was collected from wood of Corylus avellana in southern England. Using this reference specimen, we compiled genomic and ITS barcoding datasets to explore the genetic and geographic variation within this species. C_LIO_LIWe showed that G. lingzhi and G. lucidum diverged more than 12 million years ago and that all seven "G. lucidum" genomes deposited in public databases belong to other species. More than 1000 barcoding sequences showed that the widely used homology-based ITS barcoding is not working in this group, which can be mitigated by a phylogenetic placement approach. The 149 sequences assigned to G. lucidum with high confidence showed a Eurasian distribution and introductions to North and South America and Africa. C_LIO_LIOur study underscores the importance of accurate species identification and provides guidance for a group of pharmaceutical and socially significant species. To further support future studies and the wider public in differentiating between G. lingzhi and G. lucidum, we propose using "False Lingzhi" as the English name for G. lucidum. C_LI Societal Impact StatementTraditional Chinese Medicine has expanded far beyond Asia, with growing markets in North America and Europe for supplements and functional foods. Lingzhi or Reishi (Ganoderma lingzhi), a well-known medicinal mushroom, is valued for its anti-inflammatory and anticancer properties. However, it is often misidentified with species that may not provide the same health benefits. This confusion poses risks to consumer safety, product regulation, and research. Here, we establish a reference using morphological and molecular tools for the most commonly misidentified species (Ganoderma lucidum) and propose the name "False Lingzhi" to distinguish it, supporting accurate identification, safer product development, and reliable research.

17
Comprehensive bioinformatic analysis reveals novel potential diagnostic biomarkers associated with monocytes in osteoporosis

Qin, X.; Wen, B.; He, P.; Chen, Z.; Tan, S.; Mao, Z.

2026-03-24 genetics 10.64898/2026.03.20.713320 medRxiv
Top 0.5%
0.7%
Show abstract

Osteoporosis affects millions of women globally. In this study, we applied bioinformatics methods to screen for novel diagnostic biomarkers of osteoporosis in women using the GSE62402 and GSE56814 datasets. PCSK5, ZNF225, and H1FX were used to construct a diagnostic model. ROC, calibration, and decision curve analyses were performed to assess the diagnostic performance on the training (GSE56814) and external (GSE56815) datasets. The expression level of model genes was validated in GEO datasets. Furthermore, five transcription factors (ETS1, NOTCH1, MAZ, ERG, and FLI1) were identified as common upstream regulators of model genes. PCSK5, ZNF225, and H1FX serve as novel diagnostic biomarkers, providing new insights into the pathogenesis of and treatment strategies for osteoporosis in women.

18
The dynamics of piRNA expression in Blattella germanica ovaries

Pujal, D.; Ylla, G.; Bau, J.; Piulachs, M.-D.

2026-04-06 developmental biology 10.64898/2026.04.02.716027 medRxiv
Top 0.5%
0.6%
Show abstract

The cockroach Blattella germanica possesses panoistic ovaries, in which oocytes lack nurse cells and therefore need to rely on their own transcriptional activity to support embryogenesis. Ovarian development in this species involves the development of a single basal ovarian follicle (BOF) per gonadotropic cycle, a process strictly regulated by endocrine signals, primarily juvenile hormone and ecdysone, which act at both the transcriptional and translational levels. In addition, transcriptional activity in these ovaries is necessary for both regulating and genome protection, and at this level, PIWI-interacting RNAs (piRNAs) play an essential role. Although insect ovaries are known to be particularly rich in piRNAs, their function in ovary maturation is still not well defined. For this purpose, we characterize the piRNA expression dynamics across seven key developmental and reproductive stages, ranging from late nymphal instars to post-vitellogenic adults. piRNA expression in B. germanica shows coordinated fluctuations. Expression remains stable in previtellogenic ovaries, whereas vitellogenic ovaries show pronounced changes. Moreover, vitellogenic ovaries exhibit reduced piRNA diversity due to strong enrichment of a subset of highly expressed piRNAs. Our data show that although piRNAs predominantly map to transposable elements, particularly LINEs, there is a notable increase in gene-derived piRNAs toward the end of the cycle. Our results suggest regulatory roles of piRNAs in modulating both TEs and mRNAs during BOF maturation, likely related to changes in the follicular cell program.

19
Single-nucleus multiome sequencing identifies candidate regulators of mouse gastric epithelial homeostasis

Monteiro de Barros, M. R.; Bosch, K.; Soualhi, S.; Issa Bhaloo, S.; Chu, T.; Hemrajani, T.; Cho, J.; Ozuner, K.; Fu, R.; Geiger, H.; Robine, N.; Carter, J. E. B.; Maniatis, S.; Ryeom, S.; Tavare, S.; Nowicki-Osuch, K.

2026-04-27 genomics 10.64898/2026.04.23.720450 medRxiv
Top 0.5%
0.6%
Show abstract

Background & AimsGastric epithelial cells maintain homeostasis through dynamic self-renewal mechanisms involving stem and progenitor cells; however, identifying them has been challenging. This study aims to identify stem cells of healthy gastric epithelium and cell type-specific regulators defining gastric epithelial homeostasis via single-nucleus multiome analysis. MethodsTen unique gastric samples were collected from 8-12 week old wildtype mice. Isolated nuclei were subjected to simultaneous profiling of gene expression and chromatin accessibility. After quality control, 31,598 cells were analyzed with Seurat and Signac using weighted-nearest neighbors analysis for joint RNA and ATAC clustering. Furthermore, SCENIC+, MultiVelo, EpiCHAOS and Cell plasticity score were used to uncover gene regulatory networks, cell state dynamics and lineage trajectories. ResultsOur analyses were validated by the identification of known regulators of stem-cell differentiation into mature cell types. More importantly, it revealed previously uncharacterized regulatory networks comprising novel transcription factor combinations that define cell identities, including Ppara, Pparg, Arid5b and Sox5 as candidate regulators of parietal, foveolar, chief and neck cells, respectively. Further, our data support the identity of isthmus cells as stem-like cells of healthy gastric epithelium, as evidenced by epigenetic plasticity that simultaneously contains open chromatin states of all differentiated cell types in the absence of transcriptional reprogramming. ConclusionConsistent with Waddingtons epigenetic landscape hypothesis, gastric epithelial homeostasis is controlled by orchestrated epigenetic and transcriptional programs. Contrary to the prevailing hypothesis, stem cells can be defined not by a separate epigenetic state but by epigenetic superposition of differentiated cell states. Future work is needed to define the universality of these results.

20
Transcriptomic profiling of embryo-derived cell lines from the Chagas disease insect vector Rhodnius prolixus

de Andrade Tavares, L.; Garcia, A. C.; Bell-Sakyi, L.; Fontenele de Brito, T.; Pane, A.

2026-05-12 genetics 10.64898/2026.05.08.723764 medRxiv
Top 0.5%
0.6%
Show abstract

Rhodnius prolixus is a primary insect vector of Trypanosoma cruzi, the causative agent of Chagas disease, a neglected parasitosis endemic in Latin American countries. It has been estimated that Chagas disease affects 7-8 million people worldwide and is responsible for approximately 1000 deaths per year. Genetic and molecular studies in this species remain challenging due to its life cycle and feeding habits, thus hindering the development of new strategies to control their populations and reduce the diffusion of Chagas disease. Recently, two stable cell lines - RPE/LULS53 and RPE/LULS57 - were derived from Rhodnius embryos, which represent promising new tools to investigate the genetics of this insect vector. Here, we describe their gene expression landscapes through transcriptomic approaches. We show that 8,968 expressed genes are shared between the two cell lines, whereas 391 and 1,088 genes are uniquely expressed in RPE/LULS53 and RPE/LULS57, respectively. Although key components of primary developmental, immune and redox signaling pathways are expressed in both cell lines, some genes such as Frizzled-10-a-like and catalase show marked differences in expression. Our results strongly suggest that RPE/LULS53 and RPE/LULS57 likely represent two different cell phenotypes. Consistent with this, gene ontology analysis reveals that RPE/LULS53 is enriched for animal organ morphogenesis and stress response, while RPE/LULS57 for DNA-directed RNA polymerase activity, among others. Despite these differences, both cell lines express comparable levels of transcripts from resident transposable elements, including the highly abundant Mariner and LINE/I elements, as well as horizontally transferred transposons. Our findings shed light on the nature of the RPE/LULS53 and RPE/LULS57 embryo-derived cell lines and provide valuable transcriptomic resources for future genetic and functional studies in Rhodnius and other triatomine insect vectors. Author summaryRhodnius prolixus is a blood-feeding insect and a major vector of Chagas disease, a parasitosis endemic in Latin America and affecting millions of people worldwide. In the absence of effective drugs and vaccines, the control of the insect population represents a promising strategy to reduce the diffusion of the disease. Yet, genetic and functional studies in Rhodnius are extremely challenging due to its feeding habit and life cycle. To overcome these limitations, researchers have previously developed two stable cell lines derived from Rhodnius embryos. In this study, we provide the first characterization of the genes expressed in these cell lines. We found that, while the two cell lines share many expressed genes, each of them also has distinct gene expression patterns pointing to two different cell types with specialized functions. These differences likely affect the way they respond to stress and regulate biological processes. Our findings provide an important resource for researchers studying Rhodnius prolixus and other insect vectors, helping advance our understanding of the genetic and molecular mechanisms that control the insect development and mediate the interactions between insect vectors and the parasites they transmit